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(1) GENERAL INFORMATION: 



(i) APPLICANT: Sato, Takaaki 
(ii) 



TITLE OF INVENTION: TREX, A NOVEL GENE OF T RAF- INTERACTING 
EXT GENE FAMILY AND DIAGNOSTIC AND THERAPEUTIC USES 
THEREOF 



(iii) NUMBER OF SEQUENCES: 37 

15 

(IV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 

2 0 (D) STATE: New York 

(E) COUNTRY : U.S. A 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

3 0 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME : White, John P. 

(B) REGISTRATION NUMBER: 28,67 8 

(C) REFERENCE /DOCKET NUMBER: 0575/51902-A-PCT 

40 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 278-0400 
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45(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3479 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 458.. 3211 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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CCTGATCGTT GGTAGTGGCA TGGAGGACGG GGCTGGCATT TCAGACTGCC AGCTGTTTTT 60 

ACCAGCCGCT GCATCACTTG AATAGAAGCT ATGCATATTG GCTGGCCGAC AAAGCCAAGG 12 0 

5 GACAAAAGCT ATGGCCGTTA AAATGGTCCC TCTGAGTCCA GGGCTCTTTC CCTGGCTTTT 18 0 

AGCACCATGG ATCTCTTCCT TTTCATCCCA T CAGCAAT GT GGTACCTTCT T CTACTT GAT 240 

GATGACAGCT GATACTTCAG ATTTGCCTGA CTAAGGTTAG AAACCTGAAT CGCTGTGAGG 30 0 

10 AAGATGAAAT TTCCATTTTA CTTGGTGCCT TGTGCAGGGA GCACACTGAT CCTTCCAGAA 3 60 

ACTTGTGTGT GAAAAGAGGT TGCGTTTTGT CAGACAGACT CAT GGTTAT G GCGAGCGATC 42 0 

15CGACGTGATC AGAGTGGGCA AGAGGCACAG CGAACTC ATG ACA GGC TAT ACC ATG 475 

Met Thr Gly Tyr Thr Met 
1 5 

TTG CGG AAT GGG GGA GTG GGG AAC GGT GGT CAG ACC TGT ATG CTG CGC 523 
20Leu Arq Asn Gly Gly Val Gly Asn Gly Gly Gin Thr Cys Met Leu Arg 
10 15 20 

TGG TCC AAT CGC ATC CGG CTG ACA TGG CTG AGT TTC ACG CTG TTC ATC 571 
Trp Ser Asn Arg He Arg Leu Thr Trp Leu Ser Phe Thr Leu Phe He 
25 25 30 35 

ATC CTC GTC TTC TTC CCC CTC ATT GCT CAC TAT TAC CTC ACC ACT CTG 619 
lie Leu Val Phe Phe Pro Leu He Ala His Tyr Tyr Leu Thr Thr Leu 
40 45 50 

30 GAC GAG GCA GAC GAG GCT GGC AAG CGC ATC TTC GGC CCT CGG GCT GGC 667 
Asp Glu Ala Asp Glu Ala Gly Lys Arg He Phe Gly Pro Arg Ala Gly 
55 60 65 70 

3 5AGT GAG CTC TGT GAG GTA AAG CAT GTC CTT GAT CTC TGT CGG ATT CGT 715 

Ser Glu Leu Cys Glu Val Lys His Val Leu Asp Leu Cys Arg He Arg 
75 80 85 

GAG TCT GTG AGC GAA GAG CTT CTA CAG CTC GAA GCC AAG CGG CAG GAG 7 63 

4 0Glu Ser Val Ser Glu Glu Leu Leu Gin Leu Glu Ala Lys Arg Gin Glu 

90 95 100 

CTG AAC AGC GAG ATT GCC AAG CTG AAC CTC AAG ATT GAA GCC TGT AAG 811 
Leu Asn Ser Glu He Ala Lys Leu Asn Leu Lys He Glu Ala Cys Lys 
45 105 HO 115 

AAG AGC ATA GAG AAT GCC AAG CAG GAC CTG CTG CAG CTC AAG AAT GTC 859 

Lys Ser He Glu Asn Ala Lys Gin Asp Leu Leu Gin Leu Lys Asn Val 

120 125 130 

50 

ATT AGC CAG ACA GAG CAC TCC TAC AAG GAG CTG ATG GCC CAG AAC CAG 907 

He Ser Gin Thr Glu His Ser Tyr Lys Glu Leu Met Ala Gin Asn Gin 

135 140 145 150 

5 5 CCC AAA CTG TCC CTG CCC ATC CGA CTG CTC CCT GAG AAG GAC GAT GCC 955 
Pro Lys Leu Ser Leu Pro He Arg Leu Leu Pro Glu Lys Asp Asp Ala 
155 160 165 

GGC CTT CCA CCC CCC AAG GTC ACT CGG GGT TGC CGC CTT CAC AAC TGC 1003 
60 Gly Leu Pro Pro Pro Lys Val Thr Arg Gly Cys Arg Leu His Asn Cys 
170 175 180 

TTT GAT TAC TCT CGT TGT CCT CTG ACG TCT GGC TTT CCC GTC TAC GTC 1051 
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Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser Gly Phe Pro Val Tyr Val 
185 190 195 

TAT GAC AGT GAC CAG TTT GCC TTT GGG AGC TAC CTG GAC CCT TTG GTC 
5Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser Tyr Leu Asp Pro Leu Val 
200 205 210 

AAG CAG GCT TTT CAG GCT ACA GTG AGA GCC AAC GTT TAT GTT ACA GAA 
Lys Gin Ala Phe Gin Ala Thr Val Arg Ala Asn Val Tyr Val Thr Glu 
10215 220 225 230 

AAT GCG GCC ATC GCC TGC CTG TAT GTG GTG TTA GTG GGA GAA ATG CAA 

Asn Ala Ala lie Ala Cys Leu Tyr Val Val Leu Val Gly Glu Met Gin 

235 240 245 

15 

GAG CCC ACT GTG CTG CGG CCT GCC GAC CTT GAA AAG CAG CTG TTT TCT 

Glu Pro Thr Val Leu Arg Pro Ala Asp Leu Glu Lys Gin Leu Phe Ser 

250 255 260 

2 0CTG CCA CAC TGG AGG ACA GAT GGG CAC AAC CAC GTC ATT ATC AAC CTG 
Leu Pro His Trp Arg Thr Asp Gly His Asn His Val lie lie Asn Leu 
265 270 275 

TCC CGG AAG TCA GAC ACA CAG AAT CTA CTG TAC AAC GTC AGT ACA GGC 
25Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr Asn Val Ser Thr Gly 
280 285 290 

CGC CAT GTG GCC CAG TCC ACC CTC TAT GCT GCC CAG TAC AGA GCT GGC 
Arg His Val Ala Gin Ser Thr Leu Tyr Ala Ala Gin Tyr Arg Ala Gly 
30295 300 305 310 

TTT GAC CTG GTC GTG TCA CCC CTT GTC CAT GCT ATG TCT GAA CCC AAC 

Phe Asp Leu Val Val Ser Pro Leu Val His Ala Met Ser Glu Pro Asn 

315 320 325 

35 

TTC ATG GAA ATC CCA CCG CAG GTG CCA GTT AAG CGG AAA TAT CTC TTC 

Phe Met Glu lie Pro Pro Gin Val Pro Val Lys Arg Lys Tyr Leu Phe 

330 335 340 

4 0ACT TTC CAG GGC GAG AAG ATC GAG TCT CTG AGA TCT AGC CTT CAG GAG 
Thr Phe Gin Gly Glu Lys lie Glu Ser Leu Arg Ser Ser Leu Gin Glu 
345 350 355 

GCC CGT TCC TTC GAG GAA GAG ATG GAG GGC GAC CCT CCG GCC GAC TAT 
45Ala Arg Ser Phe Glu Glu Glu Met Glu Gly Asp Pro Pro Ala Asp Tyr 
360 365 370 

GAC GAT CGC ATC ATT GCC ACC CTA AAG GCT GTA CAG GAC AGC AAG CTG 
Asp Asp Arg lie lie Ala Thr Leu Lys Ala Val Gin Asp Ser Lys Leu 
50375 380 385 390 

GAT CAG GTG CTG GTA GAA TTC ACT TGC AAA AAC CAG CCG AAG CCT AGC 

Asp Gin Val Leu Val Glu Phe Thr Cys Lys Asn Gin Pro Lys Pro Ser 

395 400 405 

55 

CTG CCG ACT GAG TGG GCA CTG TGT GGG GAG CGG GAA GAC CGC CTG GAG 

Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu 

410 415 420 

60 TTA CTG AAG CTC TCC ACC TTC GCC CTC ATC ATC ACT CCC GGG GAC CCG 
Leu Leu Lys Leu Ser Thr Phe Ala Leu lie lie Thr Pro Gly Asp Pro 
425 430 435 
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CGC CTG CTC ATT TCA TCT GGG TGT GCC ACG CGG CTC TTC GAG GCC CTG 
Arg Leu Leu He Ser Ser Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu 
440 445 450 

5 GAG GTG GGG GCC GTG CCG GTG GTG CTC GGG GAG CAG GTG CAG CTC CCG 
Glu Val Gly Ala Val Pro Val Val Leu Gly Glu Gin Val Gin Leu Pro 
45 5 460 465 470 

TAC CAC GAC ATG CTG CAG TGG AAC GAG GCC GCC CTG GTG GTG CCC AAG 
lOTvr His Asp Met Leu Gin Trp Asn Glu Ala Ala Leu Val Val Pro Lys 
J 475 480 485 

CCT CGC GTC ACA GAG GTC CAC TTC CTG TTA CGA AGT CTT TCA GAC AGT 
Pro Arg Val Thr Glu Val His Phe Leu Leu Arg Ser Leu Ser Asp Ser 
15 490 495 500 

GAT CTG TTG GCC ATG AGG CGG CAA GGC CGC TTT CTC TGG GAG ACC TAC 
Asp Leu Leu Ala Met Arg Arg Gin Gly Arg Phe Leu Trp Glu Thr Tyr 
505 510 515 

20 

TTC TCC ACC GCA GAC AGT ATT TTT AAT ACC GTG CTG GCC ATG ATT AGG 
Phe Ser Thr Ala Asp Ser He Phe Asn Thr Val Leu Ala Met He Arg 
520 525 530 

25ACT CGA ATT CAG ATC CCA GCT GCT CCC ATC CGG GAA GAG GTA GCG GCT 
Thr Arg He Gin He Pro Ala Ala Pro He Arg Glu Glu Val Ala Ala 
535 540 545 550 

GAG ATC CCC CAT CGT TCA GGC AAA GCA GCT GGA ACT GAC CCC AAC ATG 
3 0Glu He Pro His Arg Ser Gly Lys Ala Ala Gly Thr Asp Pro Asn Met 
555 560 565 

GCT GAC AAT GGG GAC CTG GAC CTG GGG CCG GTA GAG ACA GAA CCA CCC 
Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro Val Glu Thr Glu Pro Pro 
35 570 575 580 

TAT GCC TCA CCT AAA TAC CTC CGC AAT TTC ACT CTG ACT GTC ACA GAC 

Tyr Ala Ser Pro Lys Tyr Leu Arg Asn Phe Thr Leu Thr Val Thr Asp 
585 590 595 

40 

TGT TAC CGT GGC TGG AAC TCT GCC CCG GGA CGG TTC CAT CTT TTT CCC 

Cys Tyr Arg Gly Trp Asn Ser Ala Pro Gly Arg Phe His Leu Phe Pro 
600 605 610 

4 5 CAC ACA CCC TTT GAT CCT GTG TTG CCC TCT GAG GCC AAA TTC TTG GGC 
His Thr Pro Phe Asp Pro Val Leu Pro Ser Glu Ala Lys Phe Leu Gly 
615 620 625 630 

TCA GGG ACT GGA TTT CGG CCG ATC GGT GGC GGG GCT GGG GGC TCT GGC 
BOser Gly Thr Gly Phe Arg Pro He Gly Gly Gly Ala Gly Gly Ser Gly 
635 640 645 

AAG GAG TTC CAG GCA GCG CTC GGA GGC AAT GTC CAG CGG GAG CAG TTC 
Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn Val Gin Arg Glu Gin Phe 
55 650 655 660 

ACA GTT GTG ATG CTG ACC TAC GAG CGG GAG GAA GTG CTC ATG AAC TCC 

Thr Val Val Met Leu Thr Tyr Glu Arg Glu Glu Val Leu Met Asn Ser 
665 670 675 

60 

CTG GAG AGA CTC AAC GGC CTC CCC TAC CTG AAC AAG GTA GTG GTG GTG 

Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu Asn Lys Val Val Val Val 
680 685 690 
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TGG AAC TCT CCC AAG CTG CCC TCG GAG GAC CTT TTG TGG CCA GAC ATT 2587 
Trv Asn Ser Pro Lys Leu Pro Ser Glu Asp Leu Leu Trp Pro Asp lie 
69 5 700 705 710 

5GGT GTC CCC ATC ATG GTC GTC CGT ACT GAG AAG AAC AGT TTG AAC AAT 2635 
Gly Val Pro lie Met Val Val Arg Thr Glu Lys Asn Ser Leu Asn Asn 
715 720 725 

CGG TTC TTG CCC TGG AAT GAG ATT GAG ACA GAG GCC ATA CTG TCC ATC 2 683 

lOAra Phe Leu Pro Trp Asn Glu lie Glu Thr Glu Ala lie Leu Ser lie 
730 735 740 

GAC GAT GAT GCT CAC CTC CGC CAT GAT GAA ATC ATG TTT GGG TTT TGG 2731 
Asp Asp Asp Ala His Leu Arg His Asp Glu He Met Phe Gly Phe Trp 
15 745 750 755 

GTG TGG AGA GAA GCA CGT GAT CGC ATT GTG GGT TTC CCT GGC CGG TAC 277 9 

Val Trp Arg Glu Ala Arg Asp Arg He Val Gly Phe Pro Gly Arg Tyr 

760 765 770 

20 

CAT GCG TGG GAC ATC CCG CAC CAG TCC TGG CTC TAC AAT TCC AAC TAC 2827 

His Ala Trp Asp He Pro His Gin Ser Trp Leu Tyr Asn Ser Asn Tyr 

775 780 785 790 

25TCC TGT GAG CTG TCC ATG GTG CTG ACG GGC GCT GCC TTC TTT CAC AAG 2875 
Ser Cys Glu Leu Ser Met Val Leu Thr Gly Ala Ala Phe Phe His Lys 
795 800 805 

TAT TAT GCC TAC CTG TAT TCT TAT GTG ATG CCC CAG GCC ATC CGG GAC 2923 

3 0Tvr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met Pro Gin Ala He Arg Asp 

810 815 820 

ATG GTG GAC GAG TAC ATC AAC TGT GAG GAT ATC GCC ATG AAC TTC CTT 2971 
Met Val Asp Glu Tyr He Asn Cys Glu Asp He Ala Met Asn Phe Leu 
35 825 830 835 

GTC TCC CAC ATC ACA CGG AAA CCC CCC ATC AAG GTG ACA TCA AGG TGG 3019 

Val Ser His He Thr Arg Lys Pro Pro He Lys Val Thr Ser Arg Trp 
840 845 850 

40 

ACT TTT CGA TGC CCA GGG TGC CCT CAG GCC CTG TCC CAT GAT GAC TCT 3067 

Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala Leu Ser His Asp Asp Ser 
855 860 865 870 

4 5 CAT TTT CAC GAG CGG CAC AAG TGT ATC AAC TTT TTT GTC AAG GTG TAC 3115 

His Phe His Glu Arg His Lys Cys He Asn Phe Phe Val Lys Val Tyr 
875 880 885 

GGC TAT ATG CCT CTC TTG TAC ACA CAG TTC AGG GTG GAC TCC GTG CTC 3163 
50Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe Arg Val Asp Ser Val Leu 
890 895 900 

TTC AAG ACC CGC CTG CCC CAT GAC AAG ACC AAG TGC TTC AAG TTC ATC 3211 
Phe Lys Thr Arg Leu Pro His Asp Lys Thr Lys Cys Phe Lys Phe He 
55 905 910 915 

TAGGGCCTTG CAGTT CTGAG GAGACAATGA GCAGAGCGAG GGGGAGT CAC CCTCAAGGTT 3271 

CCCAAGGTGT CGAAGGTCCT TGGGGACATC TGTCGGGCAG GGC CAAGAC C CTTTGCTGGG 3331 

60 

AGAGGCAGCA GGAAGAGTGG AAAGGGATAG CTGTCTTTCA TTTTGAAGTC AGCCACACTG 3391 

GGCCT GGGAT CCTGGTCAGA GACTCAGGNC GTCTGCACAG GGCACT GACT GATAGCGAAC 3451 
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ACTGAGGACT GTTCATAAGC CCAGGACA 
(2) INFORMATION FOR SEQ ID NO: 2: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: protein 

(X.i.) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

15Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Val Gly Asn Gly Gly 
1 5 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg He Arg Leu Thr Trp Leu 
20 25 30 

2 0 

Ser Phe Thr Leu Phe He He Leu Val Phe Phe Pro Leu He Ala His 
35 40 45 

Tyr Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg He 
25 50 55 60 

Phe Glv Pro Arg Ala Gly Ser Glu Leu Cys Glu Val Lys His Val Leu 
65 70 75 80 

3 OAsp Leu Cys Arg He Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu 

85 90 95 

Glu Ala Lys Arg Gin Glu Leu Asn Ser Glu He Ala Lys Leu Asn Leu 
100 105 HO 

35 Lys He Glu Ala Cys Lys Lys Ser He Glu Asn Ala Lys Gin Asp Leu 
115 120 125 

Leu Gin Leu Lys Asn Val He Ser Gin Thr Glu His Ser Tyr Lys Glu 
40 130 135 140 

Leu Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu 
145 150 155 160 

4 5 Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Val Thr Arg Gly 
165 170 175 

Cvs Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser 
180 185 190 

50 

Gly Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Ala Phe Gly Ser 
195 200 205 

Tyr Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Val Arg Ala 
55 210 215 220 

Asn Val Tyr Val Thr Glu Asn Ala Ala He Ala Cys Leu Tyr Val Val 
225 230 235 240 

6 0 Leu Val Gly Glu Met Gin Glu Pro Thr Val Leu Arg Pro Ala Asp Leu 
245 250 255 

Glu Lys Gin Leu Phe Ser Leu Pro His Trp Arg Thr Asp Gly His Asn 
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260 265 270 

His Val lie lie Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu 
275 280 285 

5 

Tyr Asn Val Ser Thr Gly Arg His Val Ala Gin Ser Thr Leu Tyr Ala 
290 295 300 

Ala Gin Tyr Arg Ala Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
10305 310 315 320 

Ala Met Ser Glu Pro Asn Phe Met Glu lie Pro Pro Gin Val Pro Val 
325 330 335 

15Lys Arg Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys lie Glu Ser Leu 
340 345 350 

Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu Gly 
355 360 365 

20 

Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie lie Ala Thr Leu Lys Ala 
370 375 380 

Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys Lys 
25385 390 395 400 

Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu 
405 410 415 

3 OArg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu lie 

420 425 430 

lie Thr Pro Gly Asp Pro Arg Leu Leu lie Ser Ser Gly Cys Ala Thr 
435 440 445 

35 

Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu Gly 
450 455 460 

Glu Gin Val Gin Leu Pro Tyr His Asp Met Leu Gin Trp Asn Glu Ala 
40465 470 475 480 

Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu Leu 
485 490 495 

4 5Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly Arg 

500 505 510 

Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser lie Phe Asn Thr 
515 520 525 

50 

Val Leu Ala Met He Arg Thr Arg He Gin He Pro Ala Ala Pro He 
530 535 540 

Arg Glu Glu Val Ala Ala Glu He Pro His Arg Ser Gly Lys Ala Ala 
55545 550 555 560 

Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro 
565 570 575 

6 Oval Glu Thr Glu Pro Pro Tyr Ala Ser Pro Lys Tyr Leu Arg Asn Phe 
580 585 590 



Thr Leu Thr Val Thr Asp Cys Tyr Arg Gly Trp Asn Ser Ala Pro Gly 
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Arq Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro Ser 
610 615 620 

5 Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly Gly 
62 5 630 635 640 

Glv Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn 
10 T 645 650 655 

Val Gin Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
660 665 670 

15 Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
675 680 685 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
690 695 700 

20 

Leu Leu Trp Pro Asp He Gly Val Pro He Met Val Val Arg Thr Glu 
705 710 715 720 

Lvs Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
25 725 730 735 

Glu Ala He Leu Ser He Asp Asp Asp Ala His Leu Arg His Asp Glu 
740 745 750 

3 0 lie Met Phe Gly Phe Trp Val Trp Arg Glu Ala Arg Asp Arg He Val 
755 760 765 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp He Pro His Gin Ser Trp 
770 775 780 

35 Leu T yr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
785 790 795 800 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
40 805 810 815 

Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
820 825 830 

45lle Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
835 840 845 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
850 855 860 

50 

Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 
865 870 875 880 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
55 885 890 895 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
900 905 910 

6 0Lys Cys Phe Lys Phe He 
915 



(2) INFORMATION FOR SEQ ID NO: 3: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

10 (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 594.. 3350 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCGGGTCCC TGAGCTGGAA GCCGGAGAGC AAGCCCTGGA GGTTCACTCT TTCAAGAAGT 60 

CGTGTGCTGA GGTGTAATGC TACACAAGTC AGAGGAAGGA AGGGTCCTGA AACACATGGC 120 

20 

CTGATTGTTG GCAAAGGCAT CATAAGAAGC TGGCATTTAT TTCTGTTCTA ACCTATTACT 180 

GTATAACTGT GAATAGACAC TAT GCATAT T TGTTGGTCAG CAAAACCAAG AAACAAGAGC 24 0 

2 5TATGGCATTT GAAAAAGTCT GTCTGATTCC AGGGTGTTTT TCCTGGGTTT CATCATCAGG 300 

TACCTCCTCC CTTTCATCTC AGCAAGAATG TGGCACCTTT TATCGTTTGA TAAAGATTAA 360 

GGACATGTTC TTTGGTCAAC AGCCAGAACT TAAAATCTGC TGGAATAGGG TCAGAGACCA 420 

30 

TTTCAGCTGC AGCTGAGGAA AATGAAATGT TCATTTTATT TGGTGCCTTG TCTGGGGAGC 48 0 

ACACTAACTC TTCTGGAAAC GTGTCAGTGA AACAGAGATC GTTTTGTGGA ATAGCAACCC 54 0 

35ATGGTTATGG CGAGTGACCC GACGTGATCT GGGGGGCAGG CTGCAGAGGA CTC ATG 596 

Met 

ACA GGC TAT ACC ATG CTG CGG AAT GGG GGC GCG GGG AAC GGA GGT CAG 644 
4 0Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly Gin 
920 925 930 935 

ACC TGC ATG CTG CGC TGG TCC AAC CGC ATC CGC CTC ACG TGG CTC AGC 692 
Thr Cys Met Leu Arg Trp Ser Asn Arg lie Arg Leu Thr Trp Leu Ser 
45 940 945 950 

TTC ACG CTC TTT GTC ATC CTG GTC TTC TTC CCG CTC ATC GCC CAC TAT 740 

Phe Thr Leu Phe Val lie Leu Val Phe Phe Pro Leu lie Ala His Tyr 
955 960 965 

50 

TAC CTC ACC ACT CTG GAT GAG GCT GAT GAG GCA GGC AAG CGG ATT TTT 788 

Tyr Leu Thr Thr Leu Asp Glu Ala Asp Glu Ala Gly Lys Arg He Phe 
970 975 980 

5 5 GGT CCC CGG GTG GGG AAC GAG CTG TGC GAG GTG AAG CAC GTG CTG GAT 836 
Gly Pro Arg Val Gly Asn Glu Leu Cys Glu Val Lys His Val Leu Asp 
985 990 995 

CTG TGC CGC ATC CGG GAG TCG GTG AGT GAA GAG CTC CTG CAG CTG GAG 884 
60Leu Cys Arg He Arg Glu Ser Val Ser Glu Glu Leu Leu Gin Leu Glu 
1000 1005 1010 1015 

GCC AAG CGC CAA GAG CTG AAC AGC GAG ATC GCC AAG CTG AAT CTG AAG 932 
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Ala Lvs Arq Gin Glu Leu Asn Ser Glu He Ala Lys Leu Asn Leu Lys 
Y 1020 1025 1030 

ATC GAA GCC TGT AAG AAG AGC ATT GAG AAC GCC AAG CAG GAC CTG CTC 
5 lie Glu Ala Cys Lys Lys Ser He Glu Asn Ala Lys Gin Asp Leu Leu 
1035 1040 1045 

CAG CTC AAG AAT GTC ATC AGC CAG ACC GAG CAT TCC TAC AAG GAG CTC 
Gin Leu Lys Asn Val He Ser Gin Thr Glu His Ser Tyr Lys Glu Leu 
10 1050 1055 1060 

ATG GCC CAG AAC CAG CCC AAG CTG TCC CTG CCC ATC CGA CTG CTC CCA 
Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu Pro 
1065 1070 1075 

15 GAG AAG GAC GAT GCC GGC CTC CCT CCC CCG AAG GCC ACT CGG GGC TGC 
Glu Lvs Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Gly Cys 
1080 Y 1085 1090 1095 

2 0 CGG CTA CAC AAC TGC TTT GAT TAT TCT CGT TGC CCT CTC ACC TCT GGC 

Ara Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser Gly 
* 1100 1105 1110 

TTC CCG GTC TAC GTC TAT GAC AGT GAC CAG TTT GTC TTT GGC AGC TAC 
25Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Val Phe Gly Ser Tyr 
1115 1120 H25 

CTG GAT CCC TTG GTC AAG CAG GCT TTT CAG GCG ACA GCA CGA GCT AAC 
Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Ala Arg Ala Asn 
30 1130 1135 H40 

GTT TAT GTT ACA GAA AAT GCA GAC ATC GCC TGC CTT TAC GTG ATA CTA 
Val Tyr Val Thr Glu Asn Ala Asp He Ala Cys Leu Tyr Val He Leu 
1145 1150 1155 

3 5 GTG GGA GAG ATG CAG GAG CCC GTG GTG CTG CGG CCT GCT GAG CTG GAG 

Val Gly Glu Met Gin Glu Pro Val Val Leu Arg Pro Ala Glu Leu Glu 
1160 1165 1170 1175 

4 OAAG CAG TTG TAT TCC CTG CCA CAC TGG CGG ACG GAT GGA CAC AAC CAT 

Lvs Gin Leu Tyr Ser Leu Pro His Trp Arg Thr Asp Gly His Asn His 
1180 1185 1190 

GTC ATC ATC AAT CTG TCA CGT AAG TCA GAT ACA CAG AAC CTT CTC TAT 
45Val He He Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu Tyr 
1195 1200 1205 

AAC GTC AGT ACT GGC CGT GCC ATG GTG GCC CAG TCC ACC TTC TAC ACT 
Asn Val Ser Thr Gly Arg Ala Met Val Ala Gin Ser Thr Phe Tyr Thr 
50 1210 1215 1220 

GTC CAG TAC AGA CCT GGC TTT GAC TTG GTC GTA TCA CCG CTG GTC CAT 
Val Gin Tyr Arg Pro Gly Phe Asp Leu Val Val Ser Pro Leu Val His 
1225 1230 1235 

5 5 GCC ATG TCT GAG CCC AAC TTC ATG GAA ATC CCA CCA CAG GTG CCG GTG 
Ala Met Ser Glu Pro Asn Phe Met Glu He Pro Pro Gin Val Pro Val 
X240 1245 1250 1255 

6 OAAG CGG AAA TAT CTC TTC ACC TTC CAG GGC GAG AAG ATT GAG TCT CTG 
Lvs Arq Lys Tyr Leu Phe Thr Phe Gin Gly Glu Lys He Glu Ser Leu 
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AGG TCT AGC CTT CAG GAG GCC CGC TCC TTC GAA GAG GAA ATG GAG GGC 
Arg Ser Ser Leu Gin Glu Ala Arg Ser Phe Glu Glu Glu Met Glu Gly 
1275 1280 1285 

5GAC CCT CCC GCC GAC TAC GAT GAC CGG ATC ATT GCC ACC CTG AAG GCG 
Asp Pro Pro Ala Asp Tyr Asp Asp Arg lie He Ala Thr Leu Lys Ala 
1290 1295 1300 

GTG CAG GAC AGC AAG CTG GAT CAG GTC CTG GTG GAA TTC ACC TGC AAA 
lOVal Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys Lys 
1305 1310 1315 

AAC CAG CCC AAA CCC AGC CTG CCG ACT GAG TGG GCA CTG TGT GGA GAG 
Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly Glu 
151320 1325 1330 1335 

CGG GAG GAC CGC TTG GAA TTG CTG AAG CTC TCC ACC TTC GCC CTC ATC 

Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu He 
1340 1345 1350 

20 

ATT ACC CCC GGG GAC CCT CGC TTG GTT ATT TCC TCT GGG TGT GCA ACA 

He Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser Gly Cys Ala Thr 
1355 1360 1365 

2 5 CGG CTC TTC GAA GCC CTG GAA GTC GGT GCC GTC CCG GTG GTG CTG GGG 

Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu Gly 
1370 1375 1380 

GAG CAG GTC CAG CTT CCC TAC CAG GAC ATG CTG CAG TGG AAC GAG GCG 

3 0Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin Trp Asn Glu Ala 

1385 1390 1395 

GCC CTG GTG GTG CCA AAG CCT CGT GTT ACC GAG GTT CAT TTC CTG CTC 
Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu Leu 
351400 1405 1410 1415 

AGA AGC CTC TCC GAT AGT GAC CTC CTG GCT ATG AGG CGG CAA GGC CGC 

Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly Arg 
1420 1425 1430 

40 

TTT CTC TGG GAG ACT TAC TTC TCC ACT GCT GAC AGT ATT TTT AAT ACC 

Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser He Phe Asn Thr 

1435 1440 1445 

4 5 GTG CTG GCT ATG ATT AGG ACT CGC ATC CAG ATC CCA GCC GCT CCC ATC 

Val Leu Ala Met He Arg Thr Arg He Gin He Pro Ala Ala Pro He 
1450 1455 1460 

CGG GAA GAG GCG GCA GCT GAG ATC CCC CAC CGT TCA GGC AAG GCG GCT 
50Arg Glu Glu Ala Ala Ala Glu He Pro His Arg Ser Gly Lys Ala Ala 
1465 1470 1475 

GGA ACT GAC CCC AAC ATG GCT GAC AAC GGG GAC CTG GAC CTG GGG CCA 
Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly Pro 
551480 1485 1490 1495 

GTG GAG ACG GAG CCG CCC TAC GCC TCA CCC AGA TAC CTC CGC AAT TTC 

Val Glu Thr Glu Pro Pro Tyr Ala Ser Pro Arg Tyr Leu Arg Asn Phe 

1500 1505 1510 

60 

ACT CTG ACT GTC ACT GAC TTT TAC CGC AGC TGG AAC TGT GCT CCA GGG 

Thr Leu Thr Val Thr Asp Phe Tyr Arg Ser Trp Asn Cys Ala Pro Gly 

1515 1520 1525 
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CCT TTC CAT CTT TTC CCC CAC ACT CCC TTT GAC CCT GTG TTG CCC TCA 
Pro Phe His Leu Phe Pro His Thr Pro Phe Asp Pro Val Leu Pro Ser 
1530 1535 1540 

5 GAG GCC AAA TTC TTG GGC TCA GGG ACT GGC TTT CGG CCT ATT GGT GGT 
Glu Ala Lys Phe Leu Gly Ser Gly Thr Gly Phe Arg Pro lie Gly Gly 
1545 1550 1555 

GGA GCT GGG GGT TCT GGC AAG GAA TTT CAG GCA GCG CTT GGA GGC AAT 
lOGly Ala Gly Gly Ser Gly Lys Glu Phe Gin Ala Ala Leu Gly Gly Asn 
1560 1565 1570 1575 

GTT CCC CGA GAG CAG TTC ACG GTG GTG ATG TTG ACT TAT GAG CGG GAG 
Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
15 1580 1585 1590 

GAA GTG CTT ATG AAC TCT TTA GAG AGG CTG AAT GGC CTC CCT TAC CTG 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 

1595 1600 1605 

20 

AAC AAG GTC GTG GTG GTG TGG AAT TCT CCC AAG CTG CCA TCA GAG GAC 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 

- 1610 1615 1620 

2 5 CTT CTG TGG CCT GAC ATT GGC GTT CCC ATC ATG GTG GTC CGT ACT GAG 

Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 
1625 1630 1635 

AAG AAC AGT TTG AAC AAC CGA TTC TTA CCC TGG AAT GAA ATT GAG ACA 

3 0Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 

164 0 1645 1650 1655 

GAG GCC ATC CTG TCC ATT GAT GAC GAT GCT CAC CTC CGC CAT GAC GAA 
Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
35 1660 1665 1670 

ATC ATG TTT GGG TTC CGG GTG TGG AGA GAA GCT CGG GAC CGC ATC GTG 
lie Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg He Val 
1675 1680 1685 

40 

GGC TTC CCT GGC CGT TAC CAC GCA TGG GAC ATC CCC CAT CAG TCC TGG 
Gly Phe Pro Gly Arg Tyr His Ala Trp Asp He Pro His Gin Ser Trp 
1690 1695 1700 

4 5 CTC TAC AAC TCC AAC TAC TCC TGT GAG CTG TCC ATG GTG CTG ACA GGT 

Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 
1705 1710 1715 

GCT GCC TTC TTT CAC AAG TAT TAT GCC TAC CTG TAT TCT TAT GTG ATG 

5 0Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 

1720 1725 1730 1735 

CCC CAG GCC ATC CGG GAC ATG GTG GAT GAA TAC ATC AAC TGT GAG GAC 
Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu Asp 
55 1740 1745 1750 

ATT GCC ATG AAC TTC CTT GTC TCC CAC ATC ACT CGG AAG CCC CCC ATC 

He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
1755 1760 1765 

60 

AAG GTG ACC TCA CGG TGG ACA TTC CGA TGC CCA GGA TGC CCT CAG GCC 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 

1770 1775 1780 
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CTG TCT CAT GAT GAC TCC CAC TTC CAC GAG CGG CAC AAG TGC ATC AAC 
Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys lie Asn 



1785 



1790 1795 



5 TTC GTG pjn G GTG TAC GGC TAC ATG CCC CTC CTG TAC ACG CAG TTC 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
1800 1805 1810 1815 

AGG GTG GAT TCT GTG CTC TTC AAG ACA CGC CTG CCC CAT GAC AAG ACC 

lOAra Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
1820 1825 1830 

AAG TGC TTC AAG TTC ATC TAGGGGCAGC GCACGGTCTG GGGAAGAGGA 
Lys Cys Phe Lys Phe lie 
15 1835 



TGAGCAGAGG 


GAGGAAGATG 


GCTCCCAAGG 


TTC CTAGGCA 


TTGCAGGACC 


TTGGGCACAT 


3440 


CTGCTGGTGG 


GTGGCCCAGA 


GCCTCTGCTG 


GAAGGGGCAG 


CAG GAGGAGT 


GGAAGGAAAC 


3500 


20 

CGCTGCCTTT 


ATCTTGAAGT 


CAGCCACACT 


GGGCCTGGAG 


CCCTGGGCGG 


AGTCCCCGGG 


3560 


GTTCCCCACA 


CAGGGCACTG 


ACTGATAGCT 


TACACTGAGG ACTGTGGCGA 


CTCTGCAGAG 


3620 


2 5 TCACT CACAC 


CGTTCGTACG 


CCCAGGACAG 


CTGGTTCGTG 


GTTTTTACAT 


TCAATAACAA 


3680 


CTATT AT GAT 


TATTTAAAAA 


GAGAAAGTTT 


CAGATTTGCC 


ATT CAAGGCT 


TATTTATATA 




TATGTGTGTG 


TATATAAATA 


CATGCACACA 


CTT GCATACA 


TATATATTTT 


TGGCTGGGGG 




30 

AGTGTGAGTT 


TTGCCTTTCT 


AAG GGAGGGA 


CCGCGCAGGC 


TCCTTTGTTC 


TGTATTCTGG 




CGGAGATGGG 


TCCTGGCCTT 


GTGTCACTGG 


CTTATCCTTA 


AAGAT CAT C T 


CCCATCCTCC 




3 5CCAGCGCCAT 


CTGTGTGCAG 


CAACCAGAAA 


GGGATGAACT 


TGGCCCTCTT 


GCGGGCCTGG 




ACAAGGTCTC 


TTCCTTACCC 


TTTCTGTTGC 


CAGT CAGCAA 


CCTGTAACTC 


ACATTCTCTT 




CCCAGTGAAT 


CCCTGGGAGC 


GCCTGACCCT 


GGTGGGCTGT 


T CAG CTT C CT 


GCTGCTGGGG 


4100 


40 

CCAGCGATTT 


TTGAGGATTT 


ATCTTTAGGC 


CAGGCTTGCC 


TCCGTACTTA 


TCCCTGCTCT 


4160 


CCCATTTCTC 


TCTTGTTTGA 


GAGAGAATGA 


GGAAGCAAAG 


AGT GAGAAAG 


AATAGGGGCT 


4220 


4 5 GAAGACGCCA 


CTCCCAGATG 


GCTCTTTCTA 


TCCTGCTCTT 


CTGTTGAAAC 


ACACGTGCTG 


4280 


TGGGCCTCAG 


GCGTTTCTGA 


AGTGCTCTTT 


CTTGGATTGG 


ACAGGAGATC 


AGCAGCGTGC 


4340 


ACATCTGCTG 


TGGTCTGAAG 


TGGTTTGCAG 


GTCAGCCTCC 


TCTCCCTAGT 


GTAGAGCAAG 


4400 


50 

CCAGTGTCCT 


TCGAGGAACC 


CACCCGGCTG 


GCCGGGAAGT 


TTTACAGCAA 


GGCGCCTGCC 


4460 


TTGGGATAAT 


TCCTTGGTGA 


AATTCACCTT 


CCCCCCGCCT 


CTGTCTGGAG 


CCCCATCCTG 


4520 


55TGTTATCTGT 


GGTTTTTGGA 


CCCCTAATGT 


CAGCTTGGCT 


GTAGGACTCC 


CCGAGGTTTG 


4580 


GTATGTGCTA 


GAACAATGGG 


AGGCTGTGAT 


TTGCTGTGTA AGCTCACATC 


CAGCCTTGGA 


4640 


ATCTAACGGG 


CATTCACAAC 


CCGAGTTACC 


ACTTTCCACT 


CCCTGCTTAG 


GATTCTGTTC 


4700 


60 

CCTGGGCTGA 


AACTGAAATA 


AGCTAATTTT 


TTGGGTCACG 


GT GG CAGT AG 


GGGAACCTAG 


4760 


GAGGGTGTGA 


GTGGCATTTG 


TCAGGGATTT 


AGC C CAT GAC 


GTGTTTCTTG 


AACC CTACTT 


4820 
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TCTGGAAGTG 


GAGTTGACTC 


TGGAAGTTTT 


C TAGC AACT G 


AACAAAAGCT 


CAGGTTTGTC 




CTGGTCATGC 


ACATGCCTTA 


AGCCAGTTCC 


GTCTTCCCTA 


GACCTTGGCA 


TCCTGT GCTT 




5 CTATTTCTTG 


GAATACGTTC 


TCCTCTGACC 


TGCCTGTACC 


ACGTGGGTCC 


TCTTCAAGTA 




CTGTTTTGAA 


GCTGGGCTCT 


TTTGTGTAGC 


TCCCACCCAC 


CTGTAGGGCT 


AGCTCGGCTT 




AAGGGAACTC 


TCCCCATTGG 


CAAACCGGAC 


CCGGCCGCCG 


CCAGGACTGT 


GTTTCCAAAG 




10 

GTTCCCCGCC 


CCCAACCCCA 


GCATCAGCCT 


GTAGCTCCCC 


TGCTGAGGCA 


GTGTGGTTAT 




GTTCCCAGCA 


GTGGGGGTCA 


GACGCCCTTC 


C T CAGAACTT 


TCTAGTTGCC 


CTCTACCTGA 




1 5 CTCCTGACTT 


GTATTCCTTT 


TAGCAGTAGC 


CTTCTTCCCT 


CGGGGAGCCA 


AAGAGTGTGG 




TGTGTGGCGC 


TATATTGTGG 


CTGCTATTTC 


ATCTGGTTTC 


TTTTAATGTG 


AGGAACT CAC 




ATACTGACTT 


CAGTGGGACT 


CGGTGAGCCG 


GGGCCGTCTG 


TGTGGTGGGA 


CCCCCTTTAG 




20 

CGGGACTCAG 


TGAGCTGGGG 


CCGTCTGTGT 


GGTGGAGCCA 


GGGCCTCTCC 


C TTTAGT GGA 




GCCAGGTTGT 


CGGGCCCCGA 


ATGTCACTGG 


TGGATCTAAG 


AAGGGCTGAG 


TGGTCTGACA 




25CCAAAACATG 


CCGCAGGGAG 


GGCTGTGGTG 


CCGGTGCTTC 


CAACAAGGAC 


AGCCCTCCTT 




GACCCTGAAA 


GGAACACTGG 


CTT GAAGGAC 


TGCAGACAGG 


CTCTGAGGGG 


CACGCCCTCC 


5660 


TCAGCGAGAG 


GCAGCAAGGT 


GGCCACAGTG 


TCACTGGTCA 


GGTGCTTCTC 


ACCACGGGAA 




30 

AGCCGCCGAC 


CTGTGACTCG 


CTTGAGATGG 


GAAAGCGGCG 


CCACAGACCC 


CGGGTCTCCT 




TGGCTGTCTG 


TGGGCCGCCC 


CTGGCCACCT 


TGTCCTGGCT 


CGCAGGGTGC 


AGGAGCGCCT 




3 5CGTTCTCTGG 


GTGGCCGGCT 


TGCTGCTCCG 


GTTT GGGCTG 


TCTTACCATA 


ACACCGTCCC 




AGGGCTCTGC 


AGGCCACTGT 


GAGCGCTGGC 


TCCCTGGGCA 


GTGCTCCTCC 


GTGTGGACTG 


5960 


TGCCTCAGGC 


CAGGGCTCAC 


CAGCTGGGGT 


CCTGTCCGGA 


AGGAT GGGAT 


CTTTCTGGGA 


6020 


40 

GCTGCGCCGG 


ACAGAGTGGG 


GAGCTCCTAG 


TTTGTGGGGG 


GAAGCTTTGA 


TAT CCATGCC 


6080 


ACGTCCATCC 


ACCCCACCCC 


TTTTCGTCAC 


GAGCACAATG 


GTCTTACATT 


GGATTTTTGT 


6140 


4 5AAAAAAATAA 


AAATAAATGG 


AGACTTTAAC 


TC 






6172 



(2) INFORMATION FOR SEQ ID NO: 4: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Gly Tyr Thr Met Leu Arg Asn Gly Gly Ala Gly Asn Gly Gly 
60 1 5 10 15 

Gin Thr Cys Met Leu Arg Trp Ser Asn Arg He Arg Leu Thr Trp Leu 
20 25 30 
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Ser Phe Thr Leu Phe 
35 

Tyr Tyr Leu Thr Thr 
5 50 

Phe Gly Pro Arg Val 
65 

lOAsp Leu Cys Arg He 
85 



Val He Leu Val Phe 
40 

Leu Asp Glu Ala Asp 
55 

Gly Asn Glu Leu Cys 
70 

Arg Glu Ser Val Ser 
90 



Phe Pro Leu He Ala His 
45 

Glu Ala Gly Lys Arg He 
€0 

Glu Val Lys His Val Leu 
75 80 

Glu Glu Leu Leu Gin Leu 
95 



Glu Ala Lys Arg Gin Glu Leu Asn Ser Glu He Ala Lys Leu Asn Leu 
100 105 110 

15 

Lys He Glu Ala Cys Lys Lys Ser He Glu Asn Ala Lys Gin Asp Leu 
115 120 125 

Leu Gin Leu Lys Asn Val He Ser Gin Thr Glu His Ser Tyr Lys Glu 
20 130 135 140 

Leu Met Ala Gin Asn Gin Pro Lys Leu Ser Leu Pro He Arg Leu Leu 
145 150 155 160 

25Pro Glu Lys Asp Asp Ala Gly Leu Pro Pro Pro Lys Ala Thr Arg Gly 
165 170 175 

Cys Arg Leu His Asn Cys Phe Asp Tyr Ser Arg Cys Pro Leu Thr Ser 
180 185 190 

30 

Gly Phe Pro Val Tyr Val Tyr Asp Ser Asp Gin Phe Val Phe Gly Ser 
195 200 205 

Tyr Leu Asp Pro Leu Val Lys Gin Ala Phe Gin Ala Thr Ala Arg Ala 
35 210 215 220 

Asn Val Tyr Val Thr Glu Asn Ala Asp He Ala Cys Leu Tyr Val He 
225 230 235 240 

4 0Leu Val Gly Glu Met Gin Glu Pro Val Val Leu Arg Pro Ala Glu Leu 
245 250 255 

Glu Lys Gin Leu Tyr Ser Leu Pro His Trp Arg Thr Asp Gly His Asn 
260 265 270 

45 

His Val He He Asn Leu Ser Arg Lys Ser Asp Thr Gin Asn Leu Leu 
275 280 285 

Tyr Asn Val Ser Thr Gly Arg Ala Met Val Ala Gin Ser Thr Phe Tyr 
50 290 295 300 

Thr Val Gin Tyr Arg Pro Gly Phe Asp Leu Vai Val Ser Pro Leu Val 
305 310 315 320 

55His Ala Met Ser Glu Pro Asn Phe Met Glu He Pro Pro Gin Val Pro 
325 330 335 



Val Lys Arg Lys Tyr Leu Phe Thr 
340 

60 

Leu Arg Ser Ser Leu Gin Glu Ala 
355 360 



Phe Gin Gly Glu Lys He Glu Ser 

345 350 

Arg Ser Phe Glu Glu Glu Met Glu 
365 
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Gly Asp Pro Pro Ala Asp Tyr Asp Asp Arg He He Ala Thr Leu Lys 
370 375 380 

Ala Val Gin Asp Ser Lys Leu Asp Gin Val Leu Val Glu Phe Thr Cys 
5385 390 395 400 

Lys Asn Gin Pro Lys Pro Ser Leu Pro Thr Glu Trp Ala Leu Cys Gly 
405 410 415 

10 Glu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr Phe Ala Leu 
420 425 430 

He lie Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser Gly Cys Ala 
435 440 445 

15 

Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro Val Val Leu 
450 455 460 

Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin Trp Asn Glu 
20465 470 475 480 

Ala Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val His Phe Leu 
485 490 495 

25Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg Arg Gin Gly 
500 505 510 

Arg Phe Leu Trp Glu Thr Tyr Phe Ser Thr Ala Asp Ser lie Phe Asn 
515 520 525 

30 

Thr Val Leu Ala Met lie Arg Thr Arg lie Gin lie Pro Ala Ala Pro 
530 535 540 

He Arg Glu Glu Ala Ala Ala Glu He Pro His Arg Ser Gly Lys Ala 
35545 550 555 560 

Ala Gly Thr Asp Pro Asn Met Ala Asp Asn Gly Asp Leu Asp Leu Gly 
565 570 575 



4 0Pro Val Glu Thr Glu 
580 

Phe Thr Leu Thr Val 
595 

45 

Gly Pro Phe His Leu 
610 

Ser Glu Ala Lys Phe 
50625 

Gly Gly Ala Gly Gly 
645 



Pro Pro Tyr Ala Ser 
585 



Thr Asp Phe Tyr Arg 
600 

Phe Pro His Thr Pro 
615 

Leu Gly Ser Gly Thr 
630 

Ser Gly Lys Glu Phe 
650 



Pro Arg Tyr Leu Arg Asn 
590 

Ser Trp Asn Cys Ala Pro 
605 

Phe Asp Pro Val Leu Pro 
620 

Gly Phe Arg Pro He Gly 
635 640 

Gin Ala Ala Leu Gly Gly 
655 



55Asn Val Pro Arg Glu Gin Phe Thr 
660 

Glu Glu Val Leu Met Asn Ser Leu 
675 680 

60 

Leu Asn Lys Val Val Val Val Trp 
690 695 



Val Val Met Leu Thr Tyr Glu Arg 

665 670 

Glu Arg Leu Asn Gly Leu Pro Tyr 
685 

Asn Ser Pro Lys Leu Pro Ser Glu 
700 
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Asp Leu Leu Trp Pro Asp He Gly Val Pro He Met Val Val Arg Thr 
705 710 715 720 

Glu Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu He Glu 
5 725 730 735 

Thr Glu Ala He Leu Ser He Asp Asp Asp Ala His Leu Arg His Asp 
740 745 750 

lOGlu He Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg He 
755 760 765 

Val Gly Phe Pro Gly Arg Tyr His Ala Trp Asp He Pro His Gin Ser 
770 775 780 

15 

Trp Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu ser Met Val Leu Thr 
785 790 795 800 

Gly Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val 
20 805 810 815 

Met Pro Gin Ala He Arg Asp Met Val Asp Glu Tyr He Asn Cys Glu 
820 825 830 

25Asp He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro 
835 840 845 

lie Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin 
850 855 860 

30 

Ala Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He 
86 5 870 875 880 

Asn Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin 



35 



890 895 

Phe Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys 
900 905 910 



4 0Thr Lys Cys Phe Lys Phe He 
5U5 

(2) INFORMATION FOR SEQ ID NO: 5: 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Cys Gly Glu Arg Glu Asp Arg Leu Glu Leu Leu Lys Leu Ser Thr 
1 5 10 15 

60 

Phe Ala Leu He He Thr Pro Gly Asp Pro Arg Leu Val He Ser Ser 
20 25 30 
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Gly Cys Ala Thr Arg Leu Phe Glu Ala Leu Glu Val Gly Ala Val Pro 
35 40 45 

Val Val Leu Gly Glu Gin Val Gin Leu Pro Tyr Gin Asp Met Leu Gin 
5 50 55 €0 

Trp Asn Glu Ala Ala Leu Val Val Pro Lys Pro Arg Val Thr Glu Val 
65 70 75 80 

10 His Phe Leu Leu Arg Ser Leu Ser Asp Ser Asp Leu Leu Ala Met Arg 

85 90 95 

Arg Gin Gly Arg Phe Leu Trp Glu Thr Tyr Phe Pro Thr Ala Asp Ser 
100 105 110 

15 

lie Phe Asn Thr Val Leu Ala Met lie Arg Thr Arg He 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 6: 

20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Arg Cys His Lys His Gin Val Phe Asp Tyr Pro Gin Val Leu Gin Glu 
35 1 5 10 15 

Ala Thr Phe Cys Val Val Leu Arg Gly Ala Arg Leu Gly Gin Ala Val 
20 25 30 

4 0 Leu Ser Asp Val Leu Gin Ala Gly Cys Val Pro Val Val He Ala Asp 

35 40 45 

Ser Tyr He Leu Pro Phe Ser Glu Val Leu Asp Trp Lys Arg Ala Ser 
50 55 60 

45 

Val Val Val Pro Glu Glu Lys Met Ser Asp Val Tyr Ser He Leu Gin 
65 70 75 80 

Ser He Pro Gin Arg Gin He Glu Glu Met Gin Arg Gin Ala Arg Trp 
50 85 90 95 

Phe Trp Glu Ala Tyr Phe Gin Ser He Lys Ala He Ala Leu Ala Thr 
100 105 110 

55 Leu Gin He He Asn Asp Arg He 

115 120 

(2) INFORMATION FOR SEQ ID NO: 7: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

10 Arg Cys Asp Arg Asp Asn Thr Glu Tyr Glu Lys Tyr Asp Tyr Arg Glu 

1 5 10 15 

Met Leu His Asn Ala Thr Phe Cys Leu Val Pro Arg Gly Arg Arg Leu 
20 25 30 

15 

Gly Ser Phe Arg Phe Leu Glu Ala Leu Gin Ala Ala Cys Val Pro Val 
35 40 45 

Met Leu Ser Asn Gly Trp Glu Leu Pro Phe Ser Glu Val lie Asn Trp 
20 50 55 60 

Asn Gin Ala Ala Val lie Gly Asp Glu Arg Leu Leu Leu Gin lie Pro 
65 70 75 80 

25 Ser Thr lie Arg Ser lie His Gin Asp Lys lie Leu Ala Leu Arg Gin 

85 90 95 

Gin Thr Gin Phe Leu Trp Glu Ala Tyr Phe Ser Ser Val Glu Lys lie 
100 105 110 

30 

Val Leu Thr Thr Leu Glu lie lie Gin Asp Arg lie 
115 120 

(2) INFORMATION FOR SEQ ID NO: 8: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Arg Cys Glu Gin Asp Pro Gly Pro Gly Gin Thr Gin Arg Gin Glu Thr 
1 5 10 15 

Leu Pro Asn Ala Thr Phe Cys Leu lie Ser Gly His Arg Pro Glu Ala 
20 25 30 

Ala Ser Arg Phe Leu Gin Ala Leu Gin Ala Gly Cys lie Pro Val Leu 
35 40 45 

Leu Ser Pro Arg Trp Glu Leu Pro Phe Ser Glu Val lie Asp Trp Thr 
50 55 60 

Lys Ala Ala lie Val Ala Asp Glu Arg Leu Pro Leu Gin Val Leu Ala 
65 70 75 80 



20 



Ala Leu Gin Glu Met Ser Pro Ala Arg Val Leu Ala Leu Arg Gin Gin 
85 90 95 

Thr Gin Phe Leu Trp Asp Ala Tyr Phe Ser Ser Val Glu Lys Val He 
5 100 105 110 

His Thr Thr Leu Glu Val He Gin Asp Arg He 
115 120 

10(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

di) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

2 5 Lys Cys Ser Gin Glu Asn Cys Ser Leu Glu Arg Arg Arg Gin Leu He 

1 5 10 15 

Gly Ser Ser Thr Phe Cys Phe Leu Leu Pro Ser Glu Met Phe Phe Gin 
20 25 30 

30 

Asp Phe Leu Ser Ser Leu Gin Leu Gly Cys He Pro He Leu Leu Ser 
35 40 45 

Asn Ser Gin Leu Leu Pro Phe Gin Asp Leu He Asp Trp Arg Arg Ala 
35 50 55 60 

Thr Tyr Arg Leu Pro Leu Ala Arg Leu Pro Glu Ala His Phe lie Val 
65 70 75 80 

4 0 Gin Ser Phe Glu He Ser Asp lie He Glu Met Arg Arg Val Gly Arg 

85 90 95 

Leu Phe Tyr Glu Thr Tyr Leu Ala Asp Arg His Leu Leu Ala Arg Ser 
100 105 110 

45 

Leu Leu Ala Ala Leu Arg Tyr Lys Leu 
115 120 

(2) INFORMATION FOR SEQ ID NO: 10: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Val Pro Arg Glu Gin Phe Thr Val Val Met Leu Thr Tyr Glu Arg Glu 
1 5 10 15 

Glu Val Leu Met Asn Ser Leu Glu Arg Leu Asn Gly Leu Pro Tyr Leu 
5 20 25 30 

Asn Lys Val Val Val Val Trp Asn Ser Pro Lys Leu Pro Ser Glu Asp 
35 40 45 

10 Leu Leu Trp Pro Asp lie Gly Val Pro lie Met Val Val Arg Thr Glu 

50 55 60 

Lys Asn Ser Leu Asn Asn Arg Phe Leu Pro Trp Asn Glu lie Glu Thr 
65 70 75 80 

15 

Glu Ala lie Leu Ser lie Asp Asp Asp Ala His Leu Arg His Asp Glu 
85 90 95 

lie Met Phe Gly Phe Arg Val Trp Arg Glu Ala Arg Asp Arg lie Val 
20 100 105 110 

Gly Phe Pro Gly Arg Tyr His Ala Trp Asp lie Pro His Gin Ser Trp 
115 120 125 

25 Leu Tyr Asn Ser Asn Tyr Ser Cys Glu Leu Ser Met Val Leu Thr Gly 

130 135 140 

Ala Ala Phe Phe His Lys Tyr Tyr Ala Tyr Leu Tyr Ser Tyr Val Met 
145 150 155 160 

30 

Pro Gin Ala lie Arg Asp Met Val Asp Glu Tyr lie Asn Cys Glu Asp 
165 170 175 

He Ala Met Asn Phe Leu Val Ser His He Thr Arg Lys Pro Pro He 
35 180 185 190 

Lys Val Thr Ser Arg Trp Thr Phe Arg Cys Pro Gly Cys Pro Gin Ala 
195 200 205 

4 0 Leu Ser His Asp Asp Ser His Phe His Glu Arg His Lys Cys He Asn 

210 215 220 

Phe Phe Val Lys Val Tyr Gly Tyr Met Pro Leu Leu Tyr Thr Gin Phe 
225 230 235 240 

45 

Arg Val Asp Ser Val Leu Phe Lys Thr Arg Leu Pro His Asp Lys Thr 
245 250 255 

Lys Cys Phe Lys Phe He 
50 260 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
55 {A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Gin Ser Gin Gly Phe Thr Gin lie Val Leu Thr Tyr Asp Arg Val 
15 10 15 

5 

Glu Ser Leu Phe Arg Val lie Thr Glu Val Ser Lys Val Pro Ser Leu 
20 25 30 

Ser Lys Leu Leu Val Val Trp Asn Asn Gin Asn Lys Asn Pro Pro Glu 
10 35 40 45 

Asp Ser Leu Trp Pro Lys lie Arg Val Pro Leu Lys Val Val Arg Thr 
50 55 60 

15 Ala Glu Asn Lys Leu Ser Asn .Arg Phe Phe Pro Tyr Asp Glu lie Glu 

65 70 75 80 

Thr Glu Ala Val Leu Ala lie Asp Asp Asp lie lie Met Leu Thr Ser 
85 90 95 

20 

Asp Glu Leu Gin Phe Gly Tyr Glu Val Trp Arg Glu Phe Pro Asp Arg 
100 105 110 

Leu Val Gly Tyr Pro Gly Arg Leu His Leu Trp Asp His Glu Ala Met 
25 115 120 125 

Asn Lys Trp Lys Tyr Glu Ser Glu Trp Thr Asn Glu Val Ser Met Val 
130 135 140 

3 0 Leu Thr Gly Ala Ala Phe Tyr His Lys Tyr Phe Asn Tyr Leu Tyr Thr 

145 150 155 160 

Lys Met Pro Gly Asp lie Lys Asn Trp Val Asp Ala His Met Asn Cys 
165 170 175 

35 

Tyr Glu Asp lie Ala Met Asn Phe Leu Val Ala Asn Val Thr Gly Lys 
180 185 190 

Ala Val lie Lys Val Thr Pro Arg Lys Lys Phe Lys Cys Pro Glu Cys 
40 195 200 205 

Thr Ala lie Asp Gly Leu Ser Leu Asp Gin Thr His Met Val Glu Arg 
210 215 220 

45 Ser Glu Cys He Asn Lys Phe Ala Ser Val Phe Gly Thr Met Pro Leu 

225 230 235 240 

Lys Val Val Glu His Arg Ala Asp Pro Val Leu Tyr Lys Asp Asp Phe 
245 250 255 

50 

Pro Glu Lys Leu Lys Ser Phe Pro Asn He Gly Ser Leu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 12: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
6 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

5 Pro Pro Ser Lys Phe Thr Ala Val lie His Ala Val Thr Pro Leu Val 

15 10 15 

Ser Gin Ser Gin Pro Val Leu Lys Leu Leu Val Ala Ala Ala Lys Ser 
20 25 30 

0 

Gin Tyr Cys Ala Gin lie He Val Leu Trp Asn Cys Asp Lys Pro Leu 
35 40 45 

Pro Ala Lys His Arg Trp Pro Ala Thr Ala Val Pro Val Val Val lie 
5 50 55 60 

Glu Gly Glu Ser Lys Val Met Ser Ser Arg Phe Leu Pro Tyr Asp Asn 
65 70 75 80 

0 He He Thr Asp Ala Val Leu Ser Leu Asp Glu Asp Thr Val Leu Ser 

85 90 95 

Thr Thr Glu Val Asp Phe Ala Phe Thr Val Trp Gin Ser Phe Pro Glu 
100 105 110 

5 

Arg He Val Gly Tyr Pro Ala Arg Ser His Phe Trp Asp Asn Ser Lys 
115 120 125 

Glu Arg Trp Gly Tyr Thr Ser Lys Trp Thr Asn Asp Tyr Ser Met Val 
0 130 135 140 

Leu Thr Gly Ala Ala He Tyr His Lys Tyr Tyr His Tyr Leu Tyr Ser 
145 150 155 160 

5 His Tyr Leu Pro Ala Ser Leu Lys Asn Met Val Asp Gin Leu Ala Asn 

165 170 175 

Cys Glu Asp He Leu Met Asn Phe Leu Val Ser Ala Val Thr Lys Leu 
180 185 190 

0 

Pro Pro He Lys Val Thr Gin Lys Lys Gin Tyr Lys Glu Thr Met Met 
195 200 205 

Gly Gin Thr Ser Arg Ala Ser Arg Trp Ala Asp Pro Asp His Phe Ala 
5 210 215 220 

Gin Arg Gin Ser Cys Met Asn Thr Phe Ala Ser Trp Phe Gly Tyr Met 
225 230 235 240 

3 Pro Leu He His Ser Gin Met Arg Leu Asp Pro Val Leu Lys Asp Gin 

245 250 255 

Val Ser He Leu Arg Lys Lys Tyr Arg Asp He Glu Arg Leu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 
D (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Glu Gly Arg Phe Ser Ala Leu lie Trp Val Gly Pro Pro Gly Gin 
1 5 10 15 

Pro Pro Leu Lys Leu lie Gin Ala Val Ala Gly Ser Gin His Cys Ala 
20 25 30 

Gin He Leu Val Leu Trp Ser Asn Glu Arg Pro Leu Pro Ser Arg Trp 
35 40 45 

Pro Glu Thr Ala Val Pro Leu Thr Val lie Asp Gly His Arg Lys Val 
50 55 60 

Ser Asp Arg Phe Tyr Pro Tyr Ser Thr lie Arg Thr Asp Ala He Leu 
65 70 75 80 

Ser Leu Asp Ala Arg Ser Ser Leu Ser Thr Ser Glu Val Asp Phe Ala 
85 90 95 

Phe Leu Val Trp Gin Ser Phe Pro Glu Arg Met Val Gly Phe Leu Thr 
100 105 110 

Ser Ser His Phe Trp Asp Glu Ala His Gly Gly Trp Gly Tyr Thr Ala 
115 120 125 

Glu Arg Thr Asn Glu Phe Ser Met Val Leu Thr Thr Ala Ala Phe Tyr 
130 135 140 

His Arg Tyr Tyr His Thr Leu Phe Thr His Ser Leu Pro Lys Ala Leu 
145 150 155 160 

Arg Thr Leu Ala Asp Glu Ala Pro Thr Cys Val Asp Val Leu Met Asn 
165 170 175 

Phe He Val Ala Ala Val Thr Lys Leu Pro Pro He Lys Val Pro Tyr 
180 185 190 

Gly Lys Gin Arg Gin Glu Ala Ala Pro Leu Ala Pro Gly Gly Pro Gly 
195 200 205 

Pro Arg Pro Lys Pro Pro Ala Pro Ala Pro Asp Cys He Asn Gin He 
210 215 220 

Ala Ala Ala Phe Gly His Met Pro Leu Leu Ser Ser Arg Leu Arg Leu 
225 230 235 240 

Asp Pro Val Leu Phe Lys Asp Pro Val Ser Val Gin Arg Lys Lys Tyr 
245 250 255 



(2) INFORMATION FOR SEQ ID NO: 14: 

60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

0 

Ser Thr Met Asp Ser Phe Thr Leu lie Met Gin Thr Tyr Asn Arg Thr 
1 5 10 15 

Asp Leu Leu Leu Lys Leu Leu Asn His Tyr Gin Ala Val Pro Asn Leu 
5 20 25 30 

His Lys Val lie Val Val Trp Asn Asn lie Gly Glu Lys Ala Pro Asp 
35 40 45 

0 Glu Leu Trp Asn Ser Leu Gly Pro His Pro lie Pro Val lie Phe Lys 

50 55 60 

Gin Gin Thr Ala Asn Arg Met Arg Asn Arg Leu Gin Val Phe Pro Glu 
55 70 75 80 

5 

Leu Glu Thr Asn Ala Val Leu Met Val Asp Asp Asp Thr Leu lie Ser 
85 90 95 

Thr Pro Asp Leu Val Phe Ala Phe Ser Val Trp Gin Gin Phe Pro Asp 
0 100 105 110 

Gin lie Val Gly Phe Val Pro Arg Lys His Val Ser Thr Ser Ser Gly 
115 120 125 

5 lie Tyr Ser Tyr Gly Ser Phe Glu Met Gin Ala Pro Gly Ser Gly Asn 

130 135 140 

Gly Asp Gin Tyr Ser Met Val Leu lie Gly Ala Ser Phe Phe Asn Ser 
145 150 155 160 

D 

Lys Tyr Leu Glu Leu Phe Gin Arg Gin Pro Ala Ala Val His Ala Leu 
165 170 175 

lie Asp Asp Thr Gin Asn Cys Asp Asp lie Ala Met Asn Phe lie lie 
5 180 185 190 

Ala Lys His lie Gly Lys Thr Ser Gly lie Phe Val Lys Pro Val Asn 
195 200 205 

D Met Asp Asn Leu Glu Lys Glu Thr Asn Ser Gly Tyr Ser Gly Met Trp 

210 215 220 

His Arg Ala Glu His Ala Leu Gin Arg Ser Tyr Cys lie Asn Lys Leu 
225 230 235 240 

Val Asn lie Tyr Asp Ser Met Pro Leu Arg Tyr Ser Asn lie Met lie 
245 250 255 

Ser Gin Phe Gly Phe Pro Tyr Ala Asn Tyr Lys Arg Lys lie 
3 260 265 270 

(2) INFORMATION FOR SEQ ID NO: 15: 



26 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Arg Gin Arg Glu Gin Phe Thr Val Val Leu Leu Thr Tyr Glu Arg Asp 
1 5 10 15 

Ala Val Leu Thr Gly Ala Leu Glu Arg Leu His Gin Leu Pro Tyr Leu 
20 25 30 

Asn Lys lie lie Val Val Trp Asn Asn Val Asn Arg Asp Pro Pro Asp 
35 40 45 

Ser Trp Pro Ser Leu His lie Pro Val Glu Phe lie Arg Val Ala Glu 
50 55 60 

Asn Asn Leu Asn Asn Arg Phe Val Pro Trp Asp Arg lie Glu Thr Glu 
65 70 75 80 

Ala Val Leu Ser Leu Asp Asp Asp lie Asp Leu Met Gin Gin Glu lie 
85 90 95 

lie Leu Ala Phe Arg Val Trp Arg Glu Asn Arg Asp Arg lie Val Gly 
100 105 110 

Phe Pro Ala Arg His His Ala Arg Tyr Gly Asp Ser Met Phe Tyr Asn 
115 120 125 

Ser Asn His Thr Cys Gin Met Ser Met lie Leu Thr Gly Ala Ala Phe 
130 135 140 

lie His Lys Asn Tyr Leu Thr Ala Tyr Thr Tyr Glu Met Pro Ala Glu 
145 150 155 160 

lie Arg Glu His Val Asn Ser lie Lys Asn Cys Glu Asp lie Ala Met 
165 170 175 

Asn Tyr Leu Val Ser His Leu Thr Arg Lys Pro Pro lie Lys Thr Thr 
180 185 190 

Ser Arg Trp Thr Leu Lys Cys Pro Thr Cys Thr Glu Ser Leu Tyr Lys 
195 200 205 

Glu Gly Thr His Phe Glu Lys Arg His Glu Cys Met Arg Leu Phe Thr 
210 215 220 

Lys lie Tyr Gly Tyr Asn Pro Leu Lys Phe Ser Gin Phe Arg Ala Asp 
225 230 235 240 

Ser lie Leu Phe Lys Thr Arg Leu Pro Gin Asn His Gin Lys Cys Phe 
245 250 255 

Lys Tyr Val 
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(2) INFORMATION FOR SEQ ID NO: 16: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

) 

(ii) MOLECULE TYPE: DNA (genomic) 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTATGGCGAG TGACCCGACG TG 
2 0(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

3 5TTGCTAAAGT GAAGGAAGTT GG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

4 0 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: DNA (genomic) 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

ACCCGACGTG ATCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5AAGAGCTCCT GCAGCTGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTCTCGTTGC CCTCTCAC 
(2) INFORMATION FOR SEQ ID NO: 21: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AT CATCAAT C TGTCACG 

40 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 

4 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

55 

ACTACGATGA CCGGATC 

(2) INFORMATION FOR SEQ ID NO: 23: 

6 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

10TTCCCTACCA GGACATGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: DNA (genomic) 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AACATGGCTG ACAACG 

(2) INFORMATION FOR SEQ ID NO: 25: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TATTGGTGGT GGAGCTGG 

45 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

60 

AATCCAGCCA TGGTCTCCTT GG 
(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i nea r 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGTCGATGCC ATTATTACCA GC 

15 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
2 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

30 

TTCCTTCCTC ATCACAG 

(2) INFORMATION FOR SEQ ID NO: 29: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AGGTCTGTGT ATGCACTTGT G 
50(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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AGTCGATGCC AT TAT TAC CA GC 
(2) INFORMATION FOR SEQ ID NO: 31: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

TTCAAGGGTG TGGAGAG 
2 0(2) INFORMATION FOR SEQ ID NO: 32: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

3 5TTGGCTGAAA GCCAACAACC TG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

AACATGCACG CATCCACAGC 

(2) INFORMATION FOR SEQ ID NO: 34: 
55 ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

5TTGTAACACA GCATGTGG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35 

GGTTCTGTCA GTATTAGCTG GG 
(2) INFORMATION FOR SEQ ID NO: 36: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TTCCTCCCTC TGCTCATCCT C 

40 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

55 

TTCCCACTCT GTCTCTC 



